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CONTINUED FRACTION EXPANSIONS IN CONNECTION WITH THE METRIC 

MAHLER MEASURE 

CHARLES L. SAMUELS 


Abstract. The metric Mahler measure was first studied by Dubickas and Smyth in 2001 as a means 
of phrasing Lehmer’s conjecture in topological language. More recent work of the author examined a 
parametrized family of generalized metric Mahler measures that gives rise to a series of new, and apparently 
difficult, problems. We establish a connection between these metric Mahler measures and the theory of 
continued fractions in a certain class of special cases. Our results enable us to calculate metric Mahler 
measures in several new examples. 


1. Introduction 


Suppose that I\ is a number field and v is a place of K dividing the place p of Q. Let K v and Q p be their 
respective completions so that K v is a finite extension of Q p . We note the well-known fact that 


v\p 


= [K: 


where the sum is taken over all places v of K dividing p. Given x £ K., 

and set 


we define ||x|| t , to be the unique 


extension of the p-adic absolute value on 1 

(i.i) = \\x\\Mmi 

If a € K , then a £ K v for every place v, so we may define the (logarithmic) Weil height by 

Ha) = X log+ 


a 1 


Due to our normalization of absolute values (1.1), this definition is independent of K, meaning that h is 
well-defined as a function on the multiplicative group Q of non-zero algebraic numbers. 

It is well-known that h(a ) = 0 if and only if a is a root of unity, and it can easily be verified that 
h(a n ) = \n\-h(a) for all integers n. In particular, we see that h(a) = h(a -1 ). If a £ Q x there exists relatively 
prime integers r and s such that a = r/s. Under these assumptions, we have that h(a) = logmax{|r|, |s|}. 

The Weil height is closely connected to a famous 1933 problem of D.H. Lehmer |s]. The (logarithmic) 
Mahler measure of a non-zero algebraic number a is defined by 


( 1 . 2 ) 


m(a) = [Q(a) : Q] • h(a). 


In attempting to construct large prime numbers, Lehmer came across the problem of determining whether 
there exists a sequence of algebraic numbers {a n }, not roots of unity, such that m(a n ) tends to 0 as 
n —> oo. This problem remains unresolved, although substantial evidence suggests that no such sequence 
exists (see [l,9 T5l 16], for instance). This assertion is typically called Lehmer’s conjecture. 


Conjecture 1.1 (Lehmer’s Conjecture). There exists c > 0 such that m(a) > c whenever a £ 
root of unity. 


is not a 


Dobrowolski [2] provided the best known lower bound on m(a) in terms of deg a, while Voutier 17 later 
gave a version of this result with an effective constant. Nevertheless, only little progress has been made on 
Lehmer’s conjecture for an arbitrary algebraic number a. 
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Dubickas and Smyth [3j[4j were the first to study a modified version of the Mahler measure that has the 
triangle inequality. A point (aq, a 2 , ■ • ■, ajv) £ (Q ) N is called a product representation of a if a = Yl n =i a ' n -’ 
and we write V(a) to denote the set of all product representations of a. Dubickas and Smyth defined the 
metric Mahler measure by 


N 


mi(a) = inf E m(a n ) : (aq, a 2 , • • •, un) £ V(a) > . 


It is verified in ( 4 ] that (a, /?) i-> mi(a/3 _1 ) is a well-defined metric on Q X /Q* ors which induces the discrete 
topology if and only if Lehmer’s conjecture is true. The author 11 -13 extended the metric Mahler measure 
to the t-metric Mahler measure 

\ 1/t 

(1.3) m t 


I / N \ 

(a) = inf < ^ m(a n ) t : (aq, a 2 , • • •, ot N ) £ V(a) 


which is well-defined for all t > 0. In this context, we examined the function 1 1 —> mt(a) for a fixed algebraic 
number a. For instance, we showed that this function is everywhere continuous and infinitely differentiable 
at all but finitely many points. 

As V{a) is infinite, if one is studying m t (a), it is often helpful to be able to replace V(a) with a finite set 
in O- A finite subset X{a) C V(a) is called an infimum set for a if 


nit(a) = min 


N 


1/t 


Y. m(a n f 


(aq,a 2 ,"- ,aqv) £ X(a) 


for all t > 0. As part of an article examining general metric heights, the author 13 established the following. 


Theorem 1.2 (S., 2014). Every algebraic number has an infimum set. 


In view of this result, we may conclude that t 1 —> m t (a) is a piecewise function with finitely many pieces 
each having the form 

/ n \ 1 / t 

t f^m(a„)M 

where (aq, a 2 ,..., ajv) £ 'P(a). In particular, there exists a product representation of a which attains the 
infimum in mt(cx) for all sufficiently large t. 


While Theorem 1.2 has these useful consequences, it also has a key weakness - it provides no method for 
constructing a particular infimum set for a. The earlier work of Jankauskas and the author |6|, although it 
applies only to the special case a £ Q, does not suffer from this same disadvantage. 

Suppose without loss of generality that a is a positive rational number with a / 1. A point 

ri r2 rN \ r-'nt \ 

1 ? * * * ; I £ ^(*M 

Sl s 2 Sn J 


is called a factorization of a if 

(i) Di, s n £ N for all 1 < n < N 

(ii) r n /s n ^ 1 for all 1 < n < N 

(iii) gcd(r m , s n ) = 1 for all 1 < m,n < IV 

If we further assume that r and s are relatively prime positive integers with a = r/s, then 


N N 

r =\\_r n and s = ]^[ s n . 

n—1 n =1 


This implies that the set J-(a) of factorizations of a is finite and we obtain the following result. 


Theorem 1.3 (Jankauskas & S., 2012). If a is a positive rational number with a / 1 then T^a) is an 
infimum set for a. 
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In the special case a £ Q, this result is an improvement over Theorem 1.2 which only establishes the 
existence of an infimum set for a, whereas Theorem |1.3| identifies a specific infimum set. In this article, 
we further specialize to the case where a has exactly two distinct prime factors. That is, we a ssume that 
a = p a / q b , where p and q are primes and a, b £ N. Our main results, Theorems 3.1 and 3.2 produce a 


particular infimum set arising from the continued fraction expansion for log q/ logp. In spite of the specialized 
nature of our results, we believe they have the following two advantages: 

(1) Our infimum set B(a) is smaller than a ), and hence, we improve the best known result (Theorem 


1.31 in the special case where a = p a /q b . 


(2) We establish a connection between the metric Mahler measures and the theory of continued fractions. 
We believe that ([2]) is of particular interest because it is the first known connection between the metric 
Mahler measures and a more classical area of number theory. 

The remainder of this article is structured as follows. In Section [2j we provide two key definitions - upper 
and lower best rational approximations to an irrational number £. Further, we apply a well-known technique 
(see [101 pp. 55-63]) to note these rational numbers are obtained via continued fraction expansions. In Section 
[3j we state our two main results showing that the infimum in rritipt) is attained using the aforementioned 
upper and lower best approximations to log q/ log p. In Section [4j we consider several new applications of 
our main results, we establish an estimate on the size of B(a) in certain special cases, and we pose some 
open questions arising from our work. Finally, we provide the proofs of all of our new results in Section [5] 

2. Upper and lower best approximations 

We shall write No = {0,1, 2, 3,...} and let Af = {(a, 6) G N 0 x N 0 : (a, b) ^ (0,0)}. If (a, 6), (c, d) £ J\T we 
define addition of these elements in the natural way 

(2.1) (a, b) + (c, d) = (a + c, b + d). 

Suppose that £ is a positive real irrational number and define the upper and lower sets for £ by 
U(£) = {(a, b) £ Af : a > b(} and £(£) = {(a, b) £ Af : a < b(} . 


It is clear from these definitions that both U{() and £(£) are closed under the addition defined in (2.1), and 


moreover, AT = U £(£) is a disjoint union. For the purposes of this article, we adopt the convention 
that gcd(a, 0) = a and that a/0 = 00 for all a £ N. As a result, we obtain that 

U(£) = {(a,6) £Af: ^ and £(£) = j(a,6) £ Af : ^ . 

Additionally, we write Q = {(a, b) £ Af : gcd(a, b) = 1}. Now define the subsets of Ui{() and W 2 (£) of U(tf) 
by 


Ui(£f) = | (a,b) £ld(£,) : - = min j — : (m , n) £ m < a j j 


and 


^ 2(0 = {(a, b) £ U(C) : “ = min [^ : ( m,n) £ U{£), n < &j j . 

Roughly speaking, we may think of the points in and £/ 2 (£) as best approximations to £ from above. 

Similarly, we define the analogous subsets £i(£) and £ 2 (£) of £(£) by 


A(0 = | (a, b) £ £(£) : ^ = max : (m,n) e £(£), m < ajj 


and 


£ 2(0 = {(«,&) £ ^ = max j™ : (m,n) € £(£), n < foj j . 

Analogous to the definitions of Ui {£) and W 2 (£), we interpret the points in £i(£) and £ 2 (£) as best ap¬ 
proximations to £ from below. It is straightforward to make the following observations regarding the above 
sets. 

Proposition 2.1. Suppose that £ is a positive real irrational number. 

(i) (a, b) £ U(ff) if and only if (6, a) £ £(£ -1 ) 

(ii) (a,b) £ld i(£) if and only if (6, a) G £ 2 (£ -1 ) 

(Hi) ( a,b ) £ W 2 (£) if and only if(b,a) £ £i(£ -1 ) 
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We also observe that, in general, we do not have Ui{(f) = lf 2 (f) or £i(£) = £2(0- F° r instance, if 
0 < £ < 1/2 then (1,1) £ U 2 {C) but (1,1) ^ Wi(£). However, we do obtain the following relationships among 
these sets. 


Proposition 2.2. Suppose that £ is a positive real irrational number, 
(i) U&) QU 2 {£) and £ 2 (£) C £,(,£) 

(a) if £ > l t/ien c? nWi(£) = c? n7/ 2 (£) 

(in) If f < 1 £ien Cl fl £i(£) = 1? fl £ 2 (£) 


Each element of Q n U\(ff) is called an upper best approximation for £ and each element of Q n £ 2 (£) is 
called a lower best approximation for £. As a cautionary note, our definition of upper best approximation is 
slightly more restrictive than what is normally seen in the literature (see |7,10 , for instance). Typically, the 
elements of Q l~l U 2 (f() are called the upper best approximations to £. However, in view of Proposition |2.2[ pj), 
we find our definition to be best suited to our purposes. Nevertheless, under the assumption that £ > 1, 
Proposition |2.2|pl|) shows that our definitions of upper and lower best approximations are equivalent to the 


more standard definitions. In this situation, there is a well-known method described in 10 pp. 55-63] (or 
see [7, p. 123] for a more brief discussion) for listing the upper and lower best approximations for £ which 
we summarize as a theorem. 


Theorem 2.3. Suppose that £ > 1 is an irrational number having continued fraction expansion given 
by f = [xq',Xi,X 2 ,Xs, ...] and let ( a,b ) £ Af\ {(1,0), (a^o, !)}• Then ( a,b ) is an upper or lower best 
approximation to £ if and only if gcd(a, b) = 1 and there exists n £ N and 1 < x < x n such that 
a/b = [a; 0 ;xi,x 2 , ..., x n - 2 , x n -i, x\. 


For comparison purposes, it is worth noting the connection between the above definitions and the definition 
of best approximation. An ordered pair (a, b) is called a best approximation^ to f if 

(i) gcd(a, b) = 1 

(ii) If |£ — a/b\ > |£ — r/s| for some (r, s) £ Af then s > b. 

If (a, b) is a best approximation to £ > 1 then it follows that (a, b) is either an upper best approximation or 
a lower best approximation to £. However, the converse of this statement is false. For instance, if we take 
£ = log 3/log 2 = 1.5849625 ... then it can be shown that (27,17) is an upper best approximation to £. On 
the other hand, 


27 



19 

17 

= 0.00327279.. 

and 

/- 

^ 12 


0.00162917... 


meaning that (27,17) is not a best approximation to f. Nevertheless, the upper and lower best approximations 
- not the best approximations - are the relevant definitions for our mains results. 


3. Main results 


As noted above, this article is devoted to studying the infimum sets for a 
factorization of a has the form 



p a /q b . In this situation, every 


where a = a± + a 2 + ■■■ + on and b = bi + b 2 + ■ ■ ■ + fpv- The first of our two main results considers the 
general case (a, b ) £ Af. 


Theorem 3.1. Let p and q be distinct primes and set £ = logg/logp. Assume that (a,b) £ Af and write 
a=p a /q b . If 

pa i pa2 paN \ 

q bl ’ q b2 ’ ’" ’ q bN ) 

is a factorization of a which attains the infimum in m t (a) for some t > 1 then, for every 1 < n < N, 
( a ni b n ) e f?nW 2 (() or ( a n ,b n ) £ £n£i(£)- 


1 A rational number satisfying this definition is often called best approximation to § of the first kind. Indeed, there is a commonly 
used definition for best approximation of the second kind which is more restrictive. However, we believe the best approximation 
of the first kind is the proper analog of upper and lower best approximations as defined in this paper. Hence, we only mention 
its definition here. 
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Unfortunately, we cannot conclude from Theorem 3.1 that (a„, b n ) is an upper or lower best approximation 
to £. That is, we cannot conclude that ( a n ,b n ) £ Q n£/i(£) or that ( a n ,b n ) £ £?n£2(0- Indeed, consider 
for instance a = 8/25 = 2 3 /5 2 so that £ = log 5/ log 2. In this case, it can be shown that 

(3-1) = 


attains the infimum in mt(a) for all t > 1. However, we have that 


1 2 log 5 

- < - < 

1 1 log 2 


meaning that (1,1) ^ Nevertheless, we are able to produce the desired stronger conclusion under the 

additional assumption that (a, b) is an upper or lower best approximation to {. 

Theorem 3.2. Suppose p and q are distinct primes and { = log q/ logp. Assume that (a, b) is an upper or 
lower best approximation for £ and set a = p a /q b . If 


p ai p a2 
q b i 1 q b 2 ' 


7^ TV 


is a factorization which attains the infimum in mtfa) for some t > 1 then, for every 1 < n < TV, (a n , b n ) is 
an upper or lower best approximation for £. 


It follows from the work of [1] and 11 that the trivial factorization (a) attains the infimum in nit(cx) for 
all t < 1 provided that a £ Q. Consequently, Theorem |3.2| allows us to determine an infimum set for a using 
the upper and lower best rational approximations for £. We define 


' qrfl 1 1 nrS^N 

B(a) = r. ■ ■ • . ^7 ) £ H<*) ■ K, b n ) £ g n (Wi(0 u c 2 (0) 


(3.2) 

’ ' l V 9 Dl <T 

so that B(a) is the set of all factorizations of a which use only upper best approximations or lower best 
approximations for £. 

Corollary 3.3. Suppose p and q are distinct primes and £ = logg/logp. If ( a,b ) is an upper or lower best 
approximation for t; then B(p a /q b ) is an infimum set for p a /q b . 


All of the pairs ( a n ,b n ) appearing in (3.2) may be determined by examining the continued fraction ex¬ 
pansion for \ogq/ logp in accordance with Theorem 2.3 This technique will be common practice throughout 
the applications to be discussed in the next section. 


4. Examples, Applications and Open Questions 

Let a be an arbitrary algebraic number and suppose that A = (a\,a 2 , ■ ■ ■, cxn) and B = (/3i,/3 2 , ..., /?m) 
are product representations of a. We say that A is equivalent to B if TV = M and there exists a bijection a : 
{1,2,..., TV} —► {1,2,..., TV} such that a n = S„( n ) for all 1 < n < TV. In this case, we write A ~ B and note 
that ~ defines an equivalence relation on the set of all product representations of a. If A = (an, a 2 , • • • , a^v) 
is a product representation of a , we define the measure function of A to be the function /a ■ (0, oo) —> [0, oo) 
given by 

l/t 


/a(0 = 


If A ~ B, it is obvious that f a = /s- 


4.1. The Characteristic Transformation of a. In order to apply the results of Section [3j it will be 
helpful to express the elements of B(a) as vectors. In this way, we will be able to convert the question of 
determining B(a) into a linear algebra question. For simplicity, we shall now write B(a) to denote the set 


of equivalence classes of points in (3.2). Suppose that p and q are primes with q > p and set £ = log qj log p. 
Further suppose that (a, b) is an upper or lower best approximation for £ and set a =p a /q b . 


Let 


(4.1) 


(ai, 6i), (a2,62),..., (a/v, fr/v) 
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T n = 


be the complete list of upper or lower best approximations to £ with a n < a and b n < b. Moreover, assume 
that b\ < b 2 < • • • < b^ and if b n = 6„+i then a n < a n + 1 . We note that the points listed in (4.11 depend 
only on a. The characteristic transformation of a is defined to be the linear transformation T a : M. N —>■ M 2 
given by 

«1 02 ' ' ' O’N 
bi b 2 • • • bN 

It is easily verified that the rows of T a are linearly independent over R, which implies that T a is a surjection 
and dimR(kerT a ) = N — 2. A factorization vector for a is a point x = (xi,x 2 , ..., Xn) T £ R w satisfying 

(i) T a (x) = (a, b) T 

(ii) x n £ No for all 1 < n < N. 

We let V(a) denote the set of all factorization vectors for a, and for each x € V(a), define the factorization 
associated to x by 

( \ 


(4.2) 


A x = 


7 bi ’ 


p ai p a2 

S, 


7^2 ’ 


761 V 




V xi times x 2 times x N times 

The following observation allows us to view elements of B(a ) as factorization vectors. 


B(a) given by 


Theorem 4.1. If a satisfies the hypotheses of Theorem 3.2 then the map <j> : V(a) 

4>fx) = A x is a bijection. 

If S C V(a) is such that (f>{S) is an infimum set for a, then we simply say that S is an infimum set for 
a. In this language, Corollary 3.3 asserts that V(a) is indeed an infimum set for a. For x £ V(a) we write 
/x = /^(x) and we say that / x is the measure function of x. If x = (xi,... ,Xn) t then 

l/t 


(4.3) 


/*(*)= 


and it follows from Theorem 4.1 and Corollary |3.3| that mt{a) = min{/ x (t) : x £ V(a)}. 


T n = 


Example 4.2. Consider a = 32/27 = 2 5 /3 3 . We shall attempt to sketch the graph of t 1 —>■ mt(a). In this 
case, we have that £ = log 3/ log 2 so that the beginning of the continued fraction expansion for £ is given by 

i= [1; 1,1,2, 2 ,3,1,5, 2,23,2, 2 ,...]. 

In view of Theorem |2.3| the first several upper and lower best approximations to £ are 

{(1,0), (1,1), (2,1), (3,2), (5,3), ( 8 ,5), (11, 7), (19,12),.. 

and therefore, the characteristic transformation of a is given by 

112 3 5 
s 0 1 1 2 3 

By solving the system T a (x) = (5, 3) T for x having non-negative integer entries, we now obtain 


V{a) = 


According to Corollary |3.3| this set forms an infimum set for a. In order to sketch the graph of 1 1 — > nit(a) we 
recall that m t (a ) = min{/ x (f) : x £ V(a)} so it will be useful to obtain graphs for f x (t) for each x £ V(ct). 
Setting x = (xi, x 2l £ 3 , £ 4 , x$) T and applying (4.31 we deduce that 

1 It 

( V / 2 J \ ( 2 J 

X3IJI 


■ 



fo\ 


(A 


(o\ 


(A 


(A 



0 


0 


1 


1 


2 


3 



0 

5 

1 

5 

0 

1 

2 

5 

1 

1 

0 



0 


1 


1 


0 


0 


0 



V) 


W 


w 


w 


W 


VV 

> 


/xW = I Xim ( ^ 


+ x 2 m | ^ 3 - 


2 2 y / 2 3 

51 ) 


+ x 5 m — 
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and hence, 


f x (t) = (a;i(log 2) 4 + £2 (log3 ) 4 + £ 3 (log4)* + 24 (log9 ) 4 + £5 (log32)*) /l 
The graphs of the measure functions / x (t), for each x £ V(cr), may be found in Figure [l] By examining the 
curves in Figure [I] we note that the set 


S(a ) = < 


is also an infimum set for a. The graphs of the measure functions of the points in S may be found in Figure 

m 

Following the definitions developed by the author in [12| , we say that a positive real number t is standard 
for a if there exists an open neighborhood U of t and (yi, 2 / 2 , • ■ •, y n ) £ R w such that 



fo\ 


M 


(°\ 


(A 



0 


0 


1 


3 



0 


1 

1 

2 

5 

0 



0 


1 


0 


0 


< 

v) 


w 


w 


W 

> 


m t (a) = (| j/i I* + 1 2 / 2 1 


\Vn 


,i/t 


for all t £ U. If t is not standard for a then we say that t is exceptional for a. We may think of exceptional 
points as those points at which the infimum attaining factorization of a is forced to change. The work of 12 
provided an example of a rational number having two exceptional pointjrl however, no example having more 
than two exceptional points was known. By examining Figures [l] and [2j the example 32/27 presented here 
clearly has three exceptional points and is the first such known example. 

The method outlined in Example |4.2| may be applied to other cases in order to produce examples of 
rational numbers having more than three exceptional points. To obtain an example with four exceptional 


points, we may make a simple adjustment to Example 4.2 


Example 4.3. Consider a = 256/243 = 2 s /3 5 . As we are using the same primes as in Example 
obtain the same list of upper and lower best approximations to £ = log 3/log 2 

{(1,0), (1,1), (2,1), (3,2), (5,3), (8,5), (11, 7), (19,12),...}, 

and therefore, the characteristic transformation of a is given by 


4.2 


we 


T n = 


Once again we solve the system T a (x) = (8, 5) T for x having non-negative integer entries to obtain 


V(a) = < 


M 
0 
0 
0 


0 

V 1 / 


/o\ 

0 

0 

1 

1 

V°7 


M 

1 

1 

0 


M 

0 

1 

2 


1 0 

Vo/ W 

Setting x = (x 3 , X 2 , £ 3 , x 4 , 25 , Xq) t and applying (4.3) we deduce that 


A\ 

2 

0 

0 

1 

V«/ 


/o\ 

1 

2 
1 
0 

Vo/ 


/i\ 

1 
0 

2 
0 

Vo/ 


/i\ 

2 

1 

1 

0 

Vo/ 


M 

2 

3 
0 
0 

Vo/ 


/ 2 \ 

3 

0 

1 

0 

Vo/ 


A\ 

3 

2 

0 

0 

Vo/ 


fx{t) = rn 


X2in 


+ x 3 m 


■ X 4 m 


■ x 3 m 


/ 2 \ 

4 

1 

0 

0 

Vo/ 


XqTTI 


/3\1 

5 
0 
0 
0 

Vo/ J 



and hence, 

/x(f) = (24 (log 2)* + £ 2 (log 3)* + x 3 (log4)‘ + * 4 (log9)‘ + a;5(log32)* + x 6 (log256) t ) 1/t 

As in our previous example, we plot the graphs of the measure functions f x (t) for each x £ V(a). These 
graphs may be found in Figure [3j In this case, we determine that the set 


This result of [l2| relied on a conjecture which was shown in |6j to be correct. 
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is also an infimum set for a and the graphs of the these measure functions are given in Figure [4j In this 
example, we observe that a has exactly four exceptional points. 


4.2. Golden Ratio Approximations. The collection of upper and lower best approximations to £ are 
particularly simple to identify when £ is close to <j) = (1 + \/5)/2. To see this, we let {h n }'^L 0 be the 
Fibonacci sequence, i.e., ho = 0, hi = 1 and h n = h n - 1 + /i ra -2 for all n > 2. It is well-known that 


h2 h± he 
^ h 3 h 5 


1 + a/5 h 5 h 3 hi 

'' < —^— <•••< — < — < — 

2 /I 4 tl2 ho 


and that {h n /h n -i}^ = i are precisely the list of convergents to </>. We now note the following important 
observation. 


Theorem 4.4. Suppose n is a positive even integer, 
(i) There exist primes p and q such that 


h n log q h n+ 1 

h n -1 log p h n 

(ii) Suppose p and q are primes satisfying |I]). If a =p h n+i/q h n then the characteristic transformation of 
a is given by 

rp _( hi /12 • • • h n +i \ 

la ~\h 0 hi ■■■ h n ) ' 

In the previous subsection we studied an example satisfying Theorem |4.4[ p]) (Example |4.2| ). Indeed, taking 
n = 4 we find that p = 2 and q = 3 satisfy the required property that 

3 log 3 5 

- < < -. 

2 log 2 3 

As a result, we obtained the characteristic transformation 


/ 1 1 2 3 5 \ 

^ 0 1 1 2 3 ) ' 


Moreover, we note that h n /h n -1 > log 3/log 2 for all n > 4. Consequently, 4 is the largest value of n for 


which the pair (p, q) = (2, 3) satisfies Theorem 4.4[i|. We now consider an example with n = 8. 


Example 4.5. Set n = 8 so that h n /h n -i = 21/13 and h n +i/h n = 34/21. Note that p = 31 and q = 257 
are primes which satisfy the required inequalities 


21 ^ log 257 ^ 34 
13 < log 31 < 21' 


If we set a = 31 34 / 25 7 21 then Theorem 4.4 |n| asserts that a has characteristic transformation given by 


T a = 


1 1 2 3 5 8 13 21 34 

0 1 1 2 3 5 8 13 21 

























After determining the set V(a) we are able to find that 


S(a) = < 



M 




(0\ 


(0\ 


0\ 


(0\ 


0\ 


/m 



0 


0 


0 


0 


0 


0 


8 


21 



0 


0 


0 


0 


0 


5 


13 


0 



0 


0 


0 


0 


3 


8 


0 


0 



0 

1 

0 

1 

0 

1 

2 

5 

5 

5 

0 

1 

0 

5 

0 



0 


0 


1 


3 


0 


0 


0 


0 



0 


1 


2 


0 


0 


0 


0 


0 



0 


1 


0 


0 


0 


0 


0 


0 



\1 




\o) 


\oj 


\o 


\o) 


l 0 ) 


0 



is an infimum set for a. In this example, a has seven exceptional points. 


Examples 4.2 and 4.5 suggest a conjecture regarding primes satisfying Theorem |4.4|ji[) . 


Conjecture 4.6. If p and q are primes satisfying 

h 


hn—l 


logg K +1 

^ log p h n 


and a = p h ™+i/q h ™ then 



fo\ 




( ° ^ 


^ h n — 



0 


0 


hn—2 


hn 



0 


0 


h n —1 


0 



0 

5 

0 

5 * * * J 

0 

1 

0 



0 


0 


0 


0 



0 


hi 


0 


0 



ho 


h 2 


0 


0 


< 

\hi) 


\o ) 


V o J 


V o / 

> 


is an infimum set for a. Moreover, a has n — 1 exceptional points. 

Conjecture |4.6| is of particular interest in view of the author’s work 13,. Indeed, Corollary 2.3 of 13 asserts 
that every algebraic number has only finitely many exceptional points. However, it is currently unkn own 
whether there is a uniform upper bound on the number of exceptional points for a £ Q. If Conjecture 4 
holds then this question is resolved in the negative, i.e., no such uniform upper bound exists. 

4.3. The size of V(a). In the introduction, we noted that our set V(a) (or equivalently 13(a)) is smaller 
than the infimum set J-(a) of all factorizations of a described by Theorem 1.3 In the case where a arises 


from Theorem |4.4| we exhibit this discrepancy by providing specific estimates on the sizes of these sets. 

We first provide a general estimate on the size of 3F(a) in the case where a = p x /q y for some x 1 y £ No- Let 
j( x) and j(y) denote the number of partitions of x and y , respectively, so it follows that #T(a) > j(x)j(y). 
A classical result of Hardy and Ramanujan [5] provides an asymptotic formula for j(x) which we may apply 
here to obtain log(#J 7 (a)) \fx + y/y. Letting fl(a) denote the total number of not necessarily distinct 
prime factors of a, we deduce that 


(4.4) 


k>g(#.F(a)) > H(a) 1/2 . 


If a is chosen to satisfy the hypotheses of Theorem |4.4lfii|) then we are able to provide an upper bound on 


V(a) which is significantly smaller than the right hand side of (4.4). 


Theorem 4.7. For each even natural number n, let p n and q n be primes such that 

K < log gn h n+ 1 


hn -1 log Pn h 

and set a n = P n" +1 /?£”• Then log(#V(a„)) < (logh„+i) 2 . 
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For purposes of comparison with (4.4), Theorem 4.7 gives 
(4.5) log(#V(a n )) < (logH(a„)) 2 

verifying that the infimum set V(a„) introduced in this article is considerably smaller than the previously 
best known infimum set J(q„). We also recall that Conjecture 4.6 proposes the existence of an infimum set 
5(a) having exactly n elements and as a result, we know that log(#5(a n )) <C loglogn(an). Therefore, if 
Conjecture 4.6 holds, then we obtain an infimum set satisfying an even stronger condition than (4.5). 


4.1 


4.5 


4.4. Minimal Infimum Sets. We recall from Corollary |3.3| and Theorem 
an infimum set for a. However, as is clear from Examples |4.2[ |4.3| and 
smaller set which is also an infimum set. These observations motivate the following definitions. 

An infimum set X(a) is called minimal if there does not exist a proper subset of X(a) which is also an 


that the set V(a) always forms 
there is often a considerably 


infimum set for a. Using the work of the author 13 , it is possible to show that every algebraic number has 
a minimal infimum set. Moreover, if X and y are minimal infimum sets for a then there exists a bijection 
a : X —> y such that / x = f a {x)- In particular, f/X = ffy, and we call this value the minimality index of a. 

are 


It is straightforward to verify that the sets 5(32/27) and 5(256/243) given in Examples 4.2 and 4.3 


minimal infimum sets for 32/27 and 256/243, respectively. However, there is currently no known algorithm 
for determining a minimal infimum set for a in general. By studying the convex hull of V(a), we are able to 
provide a partial result in this direction. 

If 5 C is finite then the convex hull of S is defined to be 


Conv(5) = < ^ < 
Ixes 


> 0 and ^ c x = 1 
xG5 , 


Note that if S contains exactly two points, then Conv(5') is simply the line segment connecting those points. 
If x £ Conv(5) but x ^ Conv(5 \ {x}) then x is called a vertex of Conv(5). We let Vert(5) denote the set 
of all vertices of Conv(5) and note that Vert(5) C 5. 

Theorem 4.8. Suppose p and q are distinct primes and f = logg/logp. Further assume that ( a,b ) is an 
upper or lower best approximation for £ and a = p a /q b . If S C V(a) is an infimum set for a then Vert(5) 
is also an infimum set for a. 


Examining our work from Example 4.2 we set a = 32/27 and recall that 


V(a) = < 


' 



(0\ 


A\ 




( l \ 


( 2 \ 

' 


0 


0 


i 


1 


2 


3 



0 


1 

? 

0 

? 

2 


1 

1 

0 



0 


1 


i 


0 


0 


0 



V) 


VV 


VV 


W 


w 


w 

> 


We observe that 


/ 1 \ 

2 

1 

0 

VO / 


( 0 \ 

1 

2 

0 

VO/ 


/2\ 

3 

0 

0 

VO/ 


proving that (1, 2,1,0, 0) T is not a vertex of V(a), and hence, removing this vector from V(a) still results in 
an infimum set for a. In fact, Theorem 14.81 reveals that 


Vert (V (a)) = 


' 



M 


n\ 


M 



' 


0 


0 


i 


i 


3 



0 

1 

1 

5 

0 

5 

2 

5 

0 



0 


1 


1 


0 


0 


< 

V) 


w 


VV 


W 


w 

> 


is an infimum set for a. Unfortunately, (1,1,0,1, 0) J is a vertex of V(a), so Theorem 4.8 does not explain 

It remains open 


why this vector does not belong to the minimal infimum set 5(32/27) from Example 4.2 
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to determine a minimal infimum set for any rational number a of form a = p a /q b , where (a, b ) is an upper 
or lower best approximation for logg/logp. 


5. Proofs 


The proof of Proposition |2.1| is sufficiently straightforward that we need not include it here. Hence, we 
proceed with the proof of Proposition |2.2| 


Proof of Theorem\2.2\ To prove 0, it suffices to prove that U\{£) C ZY 2 (£) as the other set containment 


follows from Proposition 2.1 Suppose that (a, b) £ U\(£) but (a, b) £ W 2 (£). Therefore, we have that 


a 

— = mm 
b 


(nn 1 a (Tfi 1 

in \ — : (to, n) £ U{f), m < a\ and — > min \ — : (to, n) £ t/(£), n < b > , 

In J bln ) 


where the inequality follows from the fact that (a, b) £ ZY(£). Therefore, there exists (to, n) such that 

(5.1) y > — > £ and n <b. 

b n 

By our assumption that (a, b) £ we must have that m > a. We cannot have n = 0 because then the 

inequalities would fail, so it follows that m/n > a/n > a/b, a contradiction. 

By Proposition [2TTJ it is enough to prove |u| in order to complete our proof. So we assume that £ > 1 and 
that (a, b) £ <7nW 2 (£). If b = 0 then our assumption that (a, b) £ Q forces a = 1 and clearly (1,0) £ Z£l(£). We 
may assume without loss of generality that a > 0 and b > 0. In this situation, our assumption (a, b ) £ W 2 (£) 
implies that 

(5.2) 


Now suppose that ( m,n ) £ U{£) is such that 


, to a 

4 < - < T- 
n b 


Since we have assumed that (a,b) £ W 2 (£), we must have that n > b. In order to prove that (a, b) £ 
we must prove that m > a which we shall do by contradiction. Hence, we assume that m < a. 

Since a — 1 > 0 and n > b we obtain that 

to , a—1 a—1 
> -, 

n bn 

and therefore m > a — 1. Since m £ N, we now find that a = to. Consequently, 

a 


m a 

n n ~ b 


a 

1 < b 


and hence, (a — l)/b < a/(b + 1). Simplifying this inequality, we are lead to a < b + 1 so that a < b 
contradicting our assumption that £ > 1 and (5.2). □ 


5.1. Results connected to Theorem 


3.1 


A point x £ U(ff) is called reducible with respect to £ if there 


exist y,z £ U(0 such that x = y + z. Similarly, we say that a point x £ £(£) is reducible with respect to £ 
if there exist y, z £ £(£) such that x = y + z. If x £ J\T is not reducible with respect to £, then we say that 
x is irreducible with respect to £. By applying Proposition 2.1 we find that (a, b) is irreducible with respect 
to £ if and only if (6, a) is irreducible with respect to £ -1 . The first step in our proof of Theorem 3.1 is the 
following lemma. 

Lemma 5.1. Suppose that £ is a positive real irrational number and (a, b) £ A f. 

(i) Assume ( a,b ) £W(£). Then ( a,b ) is irreducible with respect to £ if and only if (a,b) £ 5nW 2 (£). 

(ii) Assume (a, b) £ £(£). Then (a, b) is irreducible with respect to £ if and only if (a, b) £ Q D £i(£)- 


Proof. In view of Proposition 2.1 it is sufficient to prove (m) . We begin by assuming that (a, b) is irreducible. 


If (a, b) fL S then we may set gcd(a, b) = d > 1, and it follows that 

' a{d — 1) b(d — 1) 


(5.3) 


“is-5 


Clearly both summands on the right hand side of (5.3) belongs to W(£) so that (a, b) is reducible, a contra¬ 
diction. 


n 















We now assume that (a, b) ^ ZY 2 (£), so there exists ( m,n ) £ 6 /(£) with n < b such that 
(5.4) 


a m 
b > n' 


If we had that m > a then m/n < a/b < a/n < m/n, so we must have that m < a. We now attempt to 
show that (a — m,b — n) GlA (£). 

We cannot have n = 0 because this contradicts (5.41. Additionally, if n = b then (to, b) £ and the 
equality (a, b) = (a — to, 0) + (to, b) contradicts our assumption that (a, b) is irreducible. Consequently, we 
may assume that 0 < n < b. Since ( m,n ) £ 6/(£), we know that m/n > £, and since b — n > 0, we may 
multiply both sides of this inequality by b — n. Now we obtain that 


which leads to 


777 

— (b — n) > £(6 — n ) 
n 


mb 

-to > £(o — n) 

n 


Now using (5.4) and the fact that b > 0, we obtain that 

ab 


—— to > £(6 — n). 


Hence, we have shown that (a — m, b — n) £ W(£). Therefore, (a, 6 ) = (a — to, 6 — n) + (to, n) satisfies the 
requirement to be reducible, a contradiction. 

Next, we assume that (a, b) £ C- nZ^ 2 (C)- If 6 = 0 then since gcd(a, 5) = 1, we know that a = 1. Clearly 
(1, 0) is irreducible, so we may assume that 6^0. Suppose now that (a, b) is reducible. Therefore, there 
exist (ai,&i), ( 02 , 62 ) £ U{£/) such that 


(5.5) 


a = 01 + 02 and 6 = 61 + 62 . 


We cannot have that 01 = 0 or 02 = 0 because (0, k) li(£/) for any k £ No- Moreover, we cannot have 
6 -| = 0 because then 6 2 ^ 0 and 

CL CL\ + Q-2 O2 

6 62 > 62 

contradicting our assumption that (a, 6 ) £ 6/2 (£)• Similarly, we cannot have that 62 = 0. In view of these 
remarks, we may now assume that none of or, 02 , 61,62 are equal to 0 . 

We observe that 61 , 6 2 < 6 , and since (a, 6 ) £ £/ 2 (£)> we obtain that 


a or 


Applying (5.5), we deduce that 


61 


Oi + 0 2 Oi 


61 + 6 2 61 

Simplifying these inequalities, we are lead to 

0261 < oi6 2 


and 


0 02 

6 — 62 


and 


«i + a 2 a 2 
61 + 6 2 62 


and 0162 < a 2 6 i, 


and therefore 

01 _ a 2 
61 6 2 ' 

Now assume that k,£ £ N are such that gcd(fc,£) = 1 and 

k Oi a 2 

7 = h = h' 

Hence, there exist Ci, c 2 £ N such that oi = cifc, 61 = c\£, a 2 = c 2 /c and 62 
that 

a = k(ci + c 2 ) and 6 = K(ci + c 2 ) 
contradicting our assumption that gcd(a, 6 ) = 1 . 


c 2 ^, and it follows from (5.5) 


□ 
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Suppose a is a positive rational different than 1 and 


a = e (Q x r 

\Sl S 2 SN J 

is a factorization of a. We say that A is biased if either 

r n > s n for all 1 < n < N or r n < s n for all 1 < n < N. 


In the former case, we say that A is numerator biased , and in the latter case, we say that A is denominator 
biased. 

Now suppose that a = p a /q b , where (a, 6) G A f and p and q are distinct primes. In this case, every 
factorization of a must have the form 

_ / p ai p a2 p aN \ 

- 5 

where a = ai H- + On and b = b\ + ■ ■ ■ + bn- The biased factorizations of p a /q b are closely connected to 

the sets Q , W 2 (£) and £i(£). 

Lemma 5.2. Let p and q be distinct primes, and set £ = logg/logp. If ( a,b ) G £Y(£) then the following 
conditions are equivalent. 

(i) p a /q b has a no non-trivial biased factorization. 

(ii) ( a,b ) is irreducible with respect to f. 

(in) (a, b) G Q n U 2 (Q. 

Similarly, if (a,b) G £(£) then the following conditions are equivalent. 

(i) p a /q b has a no non-trivial biased factorization. 

(ii) ( a,b ) is irreducible with respect to £. 

(in) (a, b) eG n £i(C)- 


Proof. In view of Proposition 2.1 it is sufficient to prove the first statement of the lemma. Moreover, it 
follows from Lemma [5.1| that ([jij) <*=> ( pu| . Hence, we need only show that § <=» ©• 

If we assume that (a, b) is reducible with respect to £, there exist (ai, for), (a 2 , b 2 ) G ld{f) such that 
(a, b) = (ai, bi) + (a 2 , 6 2 ). Since (oj, bi) G £/(£), we know that > log g/ logp for i G {1, 2} which implies 
that 

and 


p ai > q bl 


p a2 > q b2 


Hence, (p ai /q bl ,p a2 /q b2 ) is a non-trivial biased factorization of p a /q b . 

If we assume that p a /q b has a non-trivial biased factorization (p ai /q bl ,p a2 /q b2 ) then certainly ai +a 2 = a 
and bi + b 2 = a. We also have that 

p ai > q a2 and p a2 > q b2 . 


which means that (aj,6j) G W(£) implying that ( a,b ) is reducible. 


□ 


In order to complete the proof of Theorem |3.1[ we must establish a connection between non-trivial biased 
factorizations of p a /q b and the infimum in mt(a). 

Lemma 5.3. Suppose that a is a positive rational number, A = (ri/si,..., rjv/sjv) « factorization of a, 
and t G (1, oo). If there exists n such that r n /s n has a non-trivial biased factorization, then A cannot attain 
the infimum in mt{cx). 

Proof. Suppose that r n /s n has a non-trivial biased factorization and assume without loss of generality that 
r n > s n . Therefore, there exist w, x, y, z G N such that w > x, y > z and 

Tn _W y 
S n X z ’ 

where gcd (w,x) = gcd (w,z) = gcd (y,x) = gcd (y,z) = 1. It follows that r n = wy and s n = xz. These facts 
combine to ensure that 

TO ^7“^ = l°g r n = log W + logy = TO ^ — ^ + TO (-'j 
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Since t > 1 we now obtain that 


H!) +ra (!))' >m (3' +m (!)'- 


Moreover, we clearly have that 


N 

n 

, m= 1 
\m^n 


w _ y 
x z 


so that A cannot attain the infimum in the definition of 

Theorem |3.1| now follows easily by applying Lemmas |5.2| and |5.3 


□ 


5.2. Results Connected to Theorem |3.2| . We now wish to complete the proof of Theorem 3_2 For this 
purpose, we need three additional definitions and some lemmas. Assume that £ is a positive real irrational 
number. A point (a, b) £ AT is called an upper boundary point for £ if (a, b) £ W(0 and (a, 6 + 1) £ £(£). 
Similarly, (a, b) is called a lower boundary point for £ if (a, b) £ Off) and (a + 1,6) £ U(ff). In either case, 
(a, b) will simply be called a boundary point. 

Lemma 5.4. Assume that £ is a positive real irrational number and (a, b) £ Q. Then the following conditions 
hold. 

(i) Assume (a, b ) £ U 2 (£). Then (a, b ) is an upper boundary point for £ if and only if (a, b) £ ££(£) 

(ii) Assume (a, b) £ £i(£). Then (a, b) is a lower boundary point for £ if and only if (a, b ) £ £ 2(0 ■ 


Proof. By Proposition |2.1| it is sufficient to prove either one of the assertions in the lemma. We shall prove 
|u]). In this case, assume that (a, b) £ £i(£). 

First suppose that (a, b) is a lower boundary point for £ and that (m,n) £ £(£) satisfies 

am 

£< — <€■ 
b n 

Since (a, b ) £ £i(£), we know that a < m. In order to establish that (a, b) £ £ 2(^)1 we must prove that b < n 
which we shall do by contradiction. Hence, assume that b > n. 

If a = 0 then since gcd(a, b) = 1 we obtain that (a, b) = (0,1) so that n = 1. This means that 

1 m 

< <£ 


a 

b < 1 


which contradicts our assumption that (a, b ) is a lower boundary point for £. 
Therefore, we now assume that a > 0. If b > n then 

a a m 

T < - < — < £ 

b n n 

so that (a, n) £ £(a) contradicting our assumption that (a, b) £ £i(£). If b = n then 

a m m 

t < -r = — < «• 

b b n 


But since a < a + 1 < to, we obtain 


a a +1 m 
- < —— < - < a 
b b n 


contradicting our assumption that (a, b) is a lower boundary point for £. This proves that b < n so it follows 
that (a, b) £ £ 2 (£). 

For the converse, if (a, b ) is a not lower boundary point for £ then 

a T 1 


a 


<e 


contradicting our assumption that (a, b) £ £ 2 (£)- 

We obtain an important lemma regarding boundary points in optimal factorizations. 


□ 


14 






Lemma 5.5. Suppose p and q are distinct primes and £ = logg/ logp. Assume that ( a,b ) is a boundary 
point for £ and that 


(5.6) 


/p°l p a 2 

VgN ’ gfe’ 


p a N 

q b N 


attains the infimum in mt(p a /q b ) for some t > 0. 


Then ( a n ,b n ) is a boundary point for all n. 


Proof. Assume without loss of generality that (a, b) £ £(£) so that (a, b) is a lower boundary point for £. 
This means that (a, b) £ £(£) and (a +1,5) £ W(£). Now assume that there exists n such that (a n , b n ) is not 
a boundary point and we distinguish two cases. 

We first assume that ( a n ,b n ) £ £(£). Since ( a n ,b n ) is not a lower boundary point, we know that (a n + 
1, b n ) £ £(£). As a result we have that 

(5.7) m ( —t ) = m 




If (a m , bm) £ £(£) for all 1 < m < N then we would have that 

/ 


n a+1 


N 

i+ 

, m— 1 
\m^n 


Q n 1 


9° 


< 1 


contradicting our assumption that (a + 1,6) £ LA ((f). Therefore, we assume that there exists k such that 
p ak /q bk > 1 so that 

— 1 ' 

(5.8) m ( ) > m 

However, we clearly have that 






r,bk 


q° 


N 

n 

m= 1 


rpQ‘n ~\-1 pO'k 




7 frfc 


Combining this with (5.7) and (5.8), we contradict our assumption that (5.6) attains the infimum in m* (p a /q b ) 
for some t > 0. 

Next, we assume that ( a n ,b n ) £ lA((f) so that ( a n ,b n ) is not an upper boundary point. This means that 
(a n , b n + 1) G U(£) and 


TO 


p u. 


= TO 


pU. 


We cannot have that p am /q bm > 1 for all to because then 

- ^ pQm 


P 


n 

m=1 


q u 


7^n + l 


> 1 


contradicting our assumption that (a, 6) £ £(£). Therefore, there exists k such tat p ak /q bk < 1 and it follows 
that 


TO 


pu. 




> TO 


pu. 


7 bfc-l 


once again contradicting our assumption that (5.6) attains the infimum in mt(p a /q b ) for some t > 0. □ 

We may now quickly record the proof of Theorem |3.2| as a consequence of Theorem |3.1| and Lemma |5.4| 


Proof of Theorem \3.2\ Ass ume without loss of generality that (a, b ) is an upper best approximate for £ and 
set a =p a /q b . Lemma 5.4 implies that (o,6) must be an upper boundary point for £. Further suppose that 


p °l pO.2 


q bl ’ q b2 q‘ 


p 1 
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3.1 

5.4 

□ 


is a factorization attaining the infknum in mt(a) for some t > 1. If ( a n ,b n ) £ ZI(£) then Theorem 
implies ( a n ,b n ) £ Q nll2(£)- By Lemma [5. 5 1 ( a n ,b n ) is an upper boundary point, and then by Lemma 
( a n ,b n ) £ Q n£/i(£). If ( a n ,b n ) £ £(£) we similarly obtain that ( a n ,b n ) £ Q n£ 2 (£) as required. 

5.3. Results connected to a pplications of our main results. Our ability to rewrite factorizations of 
p a /q b as vectors (Theorem 4.1) is a critical component of our applications discussed above. We begin this 
subsection by providing its proof. 


Proof of Theorem \4-l\ Suppose that {(a n , 5 n )}^i are the upper and lower best approximations of log < 7 / logp 
wit h b\ < 62 < ■ ■ Also, we may assume that a\ < 02 < .... Since a satisfies the hypotheses of Theorem 
there exists N gN such that a = p aN / q bN and hence, the characteristic transformation of a is given by 


3.2 


T = 

1 rv 


0-1 

bi 


02 

b2 


on 

b N 


Now assume that A £ B(a). Since A is factorization of a , all entries of a must have the form p a ^/q b - 
1 < n < N, and therefore, A is equivalent to the factorization 

( 


for 


\ 


P 1X1 


-jb‘2 


7&2 


V 


xx times x 2 times 
for some aq, X 2 , ■ ■ ■, a ;at £ Nq. Moreover, we note that 


N 

E 




and 


N 

E 


7 b IV 


7 bN 


x N times 


%nbn — N 


n—1 


so that (xi,X 2 , ■ ■ ■ ,Xn) t £ V(a) and it follows that cj> is surjective. Finally, the injectivity of <f> is easily 
verified completing the proof. □ 

Before proceeding with the proof of Theorem |4.4| we note an additional fact regarding the continued 
fraction expansion of an irrational number £ > 1. For simplicity, if (a, b) is an upper (or lower) best approx¬ 
imation to £, we shall say that a/b is an upper (or lower) best approximation to £. If £ = [:ro; aq, £ 2 , £ 3 ,...] 
then we have already noted in Theorem 2.3 that a/b £ Q is an upper or lower best approximation to £ if 


and only if there exists n £ N and 1 < x < x n such that 

— [a?o, X\ , X 2 , ■ ■ *, x n —i , x]. 

If n is even, then we observe two additional useful facts: 

(i) [xq; X\, X 2 , ■ ■ ■, x n -±,x\ is strictly increasing as a function of x. 

(ii) [x 0 ; xi,x 2 , ■ ■ ■ ,x n -i,x n \ £ £(£) 

Still under the assumption that n is even, we conclude that [xo; aq, aq, • • • ,x n _i,x] is a lower best approx¬ 
imation to f for all 1 < x < x n . Analogously, if n is odd then [xq; xi, X2,..., x n _i, x] is an upper best 


approximation to f for all 1 < x < x n . We are now prepared to provide our proof of Theorem 4.4 


Proof of Theorem \4-4\ Let x = h n /h n ~i and x + s = h n+ i/h n so that £ > 0. Now select p so that p E > 2. 
It is a well-known result of Chebyshev that there exists a prime q such that p°° < q < 2 p x < p x+e , and it 
follows that x < log q/ logp < x + e proving JI]) jj 

Now let £ = log q/ logp and let 4> = (l + va)/2. We first claim that h n /h n - 1 is a lower best approximation 
to f. To see this, we assume that a/b £ Q is such that 

h n a 

< — < f and b > h n -\. 


Moreover, from ([I] 
(5.9) 


hn— 1 

we deduce that either 

n n -1 


m+1 


or 


l n— 1 




m+1 


We thank Paul Fili for providing the proof of Theorem 4.1 JT|). 
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In both cases we conclude that 


hri — l 


> 


contradicting the known fact that h n /h n -x is a best approximation to <f>. 

If n = 2 then h n /h n -x = 1 and h n+ x/h n = 3/2. In this situation, the continued fraction expansion for £ 


must have 1 as its initial entry and the desired result follows from Theorem 2.3 In every other case, there 
are precisely two continued fraction expansions for h n /h n -\ given by 

y ^ = [l;l,l,...,l,l] = [l;l,l,...,l,l,2]. 

h n -1 "-v-' '-v-' 

n— i times n- 3 times 

Since h n /h n -\ is a lower best approximation to £, Theorem |2.3| implies that the continued fraction expansion 
for £ has the form 

£ — [1;I;I>*--:1)I> dn— 2 , An- 1 j d n . . .] Or £ [1,1,1,.. .,1,1, d n — 3 ? dn—2 5 &n— 1 )*•*]) 

'-V-' '-v-' 

n —2 times n- 3 times 

where in the latter case a n -3 > 2. In the latter case, we also deduce that 

[ 1 ; 1 , 1 ,..., 1 , 1 , 2 ] = and [ 1 ; 1 , 1 ,..., 1 , 1 ] = ^ 

n- 3 times n —2 times 

both belong to £(£) which contradicts the fact that h n /h n -1 < </> < h n+ i/h n < h n -i/h n - 2 - This forces 
(5.10) ^ — [ 1 , 1, 1, . . . , 1, 1, U n —2j dn— 1, d n . . .] 

n —2 times 

and the result follows again by applying Theorem 1 2.3 1 □ 

Although it isn’t necessary for the proof of Theorem |4.4| JTT]) , we note that the continued fraction expansion 
for £ given in (5.10) must have a n _ 1 = 1. That is, we know that 


£ — [lj 1) 1) • • • ) 1) lj d n —Xi On, • ■ •]• 

V -V-' 

n— 1 times 

Otherwise, h n /h n -1 and h n+ i/h n must both belong to £(£) which contradicts our assumption from Theorem 

noun). 


We now establish the proof of our estimate on the size of V(a) when a satisfies Theorem 4.4 


Proof of Theorem \4.1\ By Theorem |4.4[ we conclude that a n has characteristic transformation 

T ( h-2 • • • h n+ 1 

“ \h 0 hx ••• h n 

Assuming x = (xi, X 2 , • • •, x n , x n j r x) T £ V(a„) we know that Xi < h n +x for all i. Otherwise, we would have 

Xxhx T X2I12 “t“ * * * Xfi^ln 4 “ *£n-{-l^'n+l ^ X^hi h n -\-Xi 

a contradiction. Hence, it follows that #V(a) < (h n+ 1 + 1)" +1 . In addition, using basic facts about the 
Fibonacci sequence, we find that h n+ x > 2"/ 2 so that 

2 log h n+ 1 


n < 


log 2 


These observations yield 


log(#V(an)) = (n + 1) \og{h n+ x + 1) < ( 2logh ™ +1 + 1 ) \og(h n+1 + 1), 


log 2 


and we obtain log(#V(a n )) <C (log/i„+i) 2 as required. 


Although it is somewhat less important for our applications, the use of convex hulls in Theorem 4.8 
help us to provide a small infimum set for p a /q b . We record the proof of this result here. 
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□ 

can 


























Proof of Theorem f.8 It is well-known that Conv(S') = Conv(Vert(S)) so that 


Conv(S’) = < c x • x : c x > 0 and c x = 1 > . 

^xEVert(S') x£Vert(5) J 

If y £ S \ Vert(S') we may assume that xi, x 2 ,..., x*, £ Vert(5) and ci, c 2 ,..., Cfc £ (0,1) are such that 

(5.11) y = CiX! + c 2 x 2 H-hc fe x fe and ci + c 2 +-|-c fe = l. 

For each positive real number t, we may select z t £ {xi,x 2 ,... ,Xfc} so that 
(5-12) /z t (t) =min{/ Xl (t),/ X2 (t),...,/ Xfc (t)}. 

To complete the proof of the theorem, it is sufficient to show that f y (t) > f Xt {t) for all t £ (0, oo). 

To see this, we write y = (j/i, y 2 , • ■ ■, j/jv) and Xj = (xn, X &,..., Xjjv) for all 1 < * < k and assume that a 
has characteristic transformation 

a\ a 2 • • • aj\r 
bi b 2 • • • bjy 


T n = 


N 


fy(ty = y " w ( fJT ) = mi c * x ™ m ( Tbf 


N k 


Now observe that 


n=l x 7 n=l i—l 

Reversing the order of summation on the right hand side yields 

k N / „ \ t k 




m 


2—1 n —1 






2—1 


Now applying (5.11) and (5.12), we obtain 

k 

f y (ty > J2 c if^(ty = 


2=1 


completing the proof. 


□ 
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